TESE: generating specific protein structure test set ensembles
نویسندگان
چکیده
UNLABELLED TESE is a web server for the generation of test sets of protein sequences and structures fulfilling a number of different criteria. At least three different use cases can be envisaged: (i) benchmarking of novel methods; (ii) test sets tailored for special needs and (iii) extending available datasets. The CATH structure classification is used to control structural/sequence redundancy and a variety of structural quality parameters can be used to interactively select protein subsets with specific characteristics, e.g. all X-ray structures of alpha-helical repeat proteins with more than 120 residues and resolution <2.0 A. The output includes FASTA-formatted sequences, PDB files and a clickable HTML index file containing images of the selected proteins. Multiple subsets for cross-validation are also supported. AVAILABILITY The TESE server is available for non-commercial use at URL: http://protein.bio.unipd.it/tese/.
منابع مشابه
P-116: Absence of JMJD1A, A Testis- Specific Histone Demethylase, in Tissue Samples of TESE Negative Men
Background During mammalian spermatogenesis unique and dynamic epigenetic events occur leading to chromatin condensation. Through these events, histone demethylases such as JMJD1A play important roles in compaction of sperm chromatin, due to regulation of histone methylation dynamics and alteration of chromatin structure. As �histone methylation� is one of the best-characterized modifications i...
متن کاملImproving Boosting Methods by Generating Specific Training and Validation Sets
In previous researches it can been seen that Bagging, Boosting and Cross-Validation Committee can provide good performance separately. In this paper, Boosting methods are mixed with Bagging and Cross-Validation Committee in order to generate accurate ensembles and take benefit from all these alternatives. In this way, the networks are trained according to the boosting methods but the specific t...
متن کاملEnsembles for Time Series Forecasting
This paper describes a new type of ensembles that aims at improving the predictive performance of these approaches in time series forecasting. Ensembles are recognised as one of the most successful approaches to prediction tasks. Previous theoretical studies of ensembles have shown that one of the key reasons for this performance is diversity among ensemble members. Several methods exist to gen...
متن کاملFunnel hunting in a rough terrain: learning and discriminating native energy funnels.
Protein folding and binding is commonly depicted as a search for the minimum energy conformation. Modeling of protein complex structures by RosettaDock often results in a set of low-energy conformations near the native structure. Ensembles of low-energy conformations can appear, however, in other regions, especially when backbone movements occur upon binding. What then characterizes the energy ...
متن کاملNeural Network Ensembles from Training Set Expansions
In this work we propose a new method to create neural network ensembles. Our methodology develops over the conventional technique of bagging, where multiple classifiers are trained using a single training data set by generating multiple bootstrap samples from the training data. We propose a new method of sampling using the k-nearest neighbor density estimates. Our sampling technique gives rise ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 22 شماره
صفحات -
تاریخ انتشار 2008